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A general proof that more energy flows upscale than downscale in two-dimensional (2D) 
turbulence and barotropic quasi-geostrophic (QG) turbulence is given. A proof is also 
given that in Surface QG turbulence, the reverse is true. Though some of these results 
are known in restricted cases, the proofs given here are pedagogically simpler, require 
fewer assumptions and apply to both forced and unforced cases. 



1. Introduction 

It is a well-known result that energy is transferred by nonlinear wave- wave interaction 
predominantly upscale in two-dimensional homogeneous and isotropic (2D) turbulence, 
and in quasi-geostrophic (QG) turbulence (see ISalmonl jj.998)). What is less commonly 
known is the fact that, except for certain special cases, a general unified proof spanning 
both the forced-dissipative and the decaying cases is not yet available. Furthermore, it 
is also not widely appreciated that these results cannot be readily extended to models of 
quasi-geostrophic turbul ence. 

It was recognized by iFicirtafd l|l953l) and iGna.rnevI l|l97lh that the direction of net 
energy transfer in 2D and QG turbulence may be different from that for 3D isotropic 
and homogeneous turbulence and that the cause for this different behavior should be 
attributed to the f ormer's twin conservation of en ergy and enstrophy. Ho wever, as pointed 
out previously bv iMerilees fc Warn! J 19751) and iTung fc Welch! l|200ll) . the analysis of 
triadic transfers by Fiortoft (195J3) and iGharne v (1971) was flawed. These proofs made 
use of the simultaneous conservation of energy and enstrophy in 2D and QG turbulence, 
and the fact that enstrophy spectrum G{k) is related to the energy spectrum E{k) by 
G(k) = k 2 E(k). They claimed to have shown that if a unit of energy is moved downscale, 
many more units of it have to be moved upscale in order to preserve the twin energy and 
enstrophy conservation. However, the direction of energy flow in time cannot and should 
not be determined by conservational considerations alone. Either an essential use of the 
dissipation terms has to be made to set the direction of the time arrow, or a constraining 
assumption has to be introduced on the initial condition to employ a Boltzmann-type 
argument. 

In h is paper, iFiOrtoftJ l)l953l) gives two distinct proofs. The first proof by iFiOrtoftl 
(1953) does show that the only admissible triad interactions are those that spread energy 
from the middle wavenumber to the outer wavenumbers (and vice versa, the ones that 
bring in energy to the central wavenumber fr om the outer wavenumbers). These are the 
triad interactions defined by IWaleffd l)l992f) as class "R" . An alternative set of triad 
interactions are the ones where energy is transfered from the smallest wavenumber to 
the two largest ones; these are the class "F" triad interactions, and they are dominant 
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in three-dimensional turbulence. Fj0rt0ft's proof can be empl oyed to rule these out in 
two-dimensional turbulence. However, as was pointed out by iMerilees fc l|l97fih 
there exist also class "R" triad interactions that transfer more energy downscale than 
upscale. Thus, eliminating the class "F" interactions is not sufficient to constrain the 
direction of the energy flux or the enstrophy flux . Despite this proble m, Fj0rt0ft's p roof 
has been popularized in textbooks l|Sarmonlll998j) and review articles l)Tabelindl2002j) as 
a rigorous proof that constrains the direction of the fluxes in two-dimensional turbulence, 
thereby becoming a b it of a misunde rstood "folklore" argument. 

The second result of lFidrtafd ill 953h is an upper bound on the total energy accumulated 
on wavenumbers larger than some given k. This result however applies only to initial 
value problems without forc ing, where ener gy has to be bounded, unsurprisingly. This 
inequality was later taken bv lCharnevI (Il97lh as a proof that energy cannot go downscale, 
since the energy E >k (t) accumulated at wavenumbers larger than k is bounded by 

E >k (t) < lc(t) < 1(3(0) (1.1) 

where G(t) is the total enstrophy at tim e t. Thus, the energy sp ectrum E(k, t) is bounded 
by E(k, t) < ck~ 3 for some constant c. iTung fc Welcbl (2001) pointed out that this be- 
havior of the energy spectrum is merely a consequence of the requirement for convergence 
of the Fourier representation of the enstrophy spectrum G(fc), which implies that G(fc) 
must decay faster than fc _1 as k —* oo. Therefore the energy spectrum E(k) must decay 
faster than k~ 3 as k — > oo. It says nothing about the direction of energy cascade, thus 
it does not help Fj0rt0ft's "proof" in the first half of the paper. On the other hand, 
equation (|3.18l) of o ur pa per, which involves fluxes, is derived via the same mathematical 
trick that first used to derive his inequality, which involves s pectra. 

The deeper conceptual difficulty with Fj0rt0ft's result was also recognized bv lKraichnanl 
(1967) in section 3 of his paper. As is well-known, the Euler equation is invariant with re- 
spect to time reversal, and as Kraichnan himself has observed, the direction of the energy 
and enstrophy flux can be reversed simply "by reversing the velocity field everywhere in 
space" . In other words, for every initial condition where the fluxes go one way, there is an- 
other initial condition where they go the opposite way. This conundrum is similar to the 
situation in statistical mechanics where experience suggests that entropy-increasing solu- 
tions are statistically more probable than the entropy-decreasing solutions even though 
the underlying dynamical system is symmetric under time reversal. The arguments used 
to resolve this apparent paradox involve selecting a class of initial conditions which give 
entropy increasing solutions, and arguing in some way that t he initial conditio ns have 
to be members of that class in order to be physically realistic. iKraichnanl l|l967|) tries to 
define the direction of the time arrow by assuming that the cascade energy spectrum has 
an "urge" to go towards the energy spectrum corresponding to absolute thermodynamic 
equilibrium, which implicitly assumes the vali dity of a Boltzmann-t ype thermodynamic 
argument. Except for an interesting proof bv iRhinesl l|l975l fl9 79). which will be dis- 
cussed next, we are not aware of any convincing such argument that decides that the 
direction of the fluxes for the case of the Euler equation points toward thermodynamic 
equilibrium. On the other hand, because the Navier-Stokes equations are dissipative and 
more realistic than the Euler equation, a proof for the Navier-Stokes equation is more 
relevant. As we shall see, in the forced-dissipative case, the direction of the time arrow 
is decided by the dissipation terms of the Navier-Stokes equations without requiring any 
further assumptions. 

The Rhines proof is ap plicable to the case of unforced decaying turbulence l|Rhinesl 
Il975l Il979t ISalmonl ll998'l. In his argument, Rhines begins with the assumption that 
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an initial peak of energy in the energy spectrum has the tendency to spread out. This 
assumption constrains the set of initial conditions and thus defines the direction of the 
time arrow. Then, he shows that the energy weighted wavenumber, which represents the 
average location of the peak, will decrease in time therefore moving the peak to larger 
length scales. From this, he argues that the energy therefore has a tendency to go upscale. 
Although Rhincs originally intende d the proof to apply to both the viscous case and the 
inviscid case, there was an error in iRhinei {.1979), where the dissipation of energy was 
ignored while that of the enstrophy was kept. Consequently the pro of, in i t s pub lished 
form, is valid only for the inviscid case. This problem was remedied bv lScottl l|200l|) . who 
gave a corrected proof for the case of molecular diffusion and Ekman damping. In the 
present paper we will extend the proof to the case of hypcrdiffusion. However, we will 
also show that, curiously enough, the proof does not work for the case of hypodiffusion. 

It should be emphasized that Rhines' proof derives a statement involving the time 
derivative of the global integral of a quantity involving the energy spectrum. As such, it 
establishes a global tendency for the energy spectrum as a whole to shift toward smaller 
wavenumbers. However, it would be incorrect to draw conclusions on the behavior of 
the energy flux at local intervals of wavenumbers from a global result. For example, 
one cannot conclude from this proof that the energy flux on the downscale side of the 
forcing range goes upscale. Furthermore, because the scope of the proof is confined to 
the decaying problem, one can draw no conclusions on the direction of the energy flux 
from this proof for the forc ed-dissipative case. 

A nice proof was given bv lEvinkl l)l99fil) for the forced-dissipative cas e in section 3.1.1 of 
his pa p er (see eqs. (3.2 ) to ( 3.8)). Similar proofs have also been given bv lTran k Snenherdl 
l(2004UDanilovf l|2005h and lOkioulekas k Tund i|2005dY The advantage of this proof is 
that it allows the dissipation terms to decide the direction of the time arrow. Further- 
more, it directly considers the behaviour of the energy flux instead of inferring it from 
the time-derivatives of the energy spectrum. On the other hand, it cannot be easily 
extended to the case of decaying turbulence. Furthermore, this proof also requires an 
assumption: it requires that inertial ranges exist and that there is a separation of scales 
between the upscale and downscale dissipation scale and the forcing scale. These assump- 
tions are well supported by numerical simulations llBoffetta. Celani k VergassolalEoba 
llshihira k Kanedall200ll iLindborg k AlveliudEoOfi IPasciuero k Falkovicbll2002|) . How- 
ever, unlike the case of three-dimensional turbulence where the energy cascade is very 
robust, in two-dimensional turbulence there are si tu ations where the inertial rang es do 
not exist l|Danilovll2005t iDanilov fc Guraridl2001 Jl iTran k Bowmanll2003L EoO^) . 

In the remainder of his paper, lEvinkl l|l996l) shows that the underlying assumptions 
on the existence of inertial ranges can be rigorously reduced to the hypothesis t hat the 
total e nergy of the system remains finite in the limit v — > + (see Hypothesis 1 in lEvinkl 
l|l99fil) ). As was pointed out to us by an anonymous referee, from a physical point of view, 
this hypothesis serves to rule out the possibility that the energy injected at the forcing 
range will simply pile up in the spectral neighborhood of the injection and di verge to 
infinity in th e limit v — > + . Thus, it is similar, in meaning, to the hypothesis of R hinesl 
1979) that instead o f piling up at forcing, the injected energy will want to spread 
out. From this hypothesis. lEvinkl l|199fil) shows that there exist regions of constant flux 
both upscale and downscale of the forcing scale and derives bounds on the location of the 
dissipation scales that prove a separation of scales between the dissipation scales and the 
forcing scale. It should be noted that the presence of a constant flux does not guarantee 
that a local cascade of energy and enstrophy is the dominant effect. A constant flux may 
also result from non-local transfer that takes energy and enstrophy directly from the 
forcing scale to the dissipation scales. 
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In the present paper we will use the energy flux approach and derive an inequality 
for the general case that shows that the weighted average of the energy flux is negative 
and the weighted average of the enstrophy flux is positive. The averages involved are 
such that the inequalities can be satisfied only when most of the energy goes upscale 
and most of the enstrophy goes downscale. For example, the energy flux inequality gives 
more weight to large wavenumbers than small wavenumbers. Consequently, the upscale 
energy flux at small wavenumbers must be significantly larger than the downscale flux 
at large wavenumbers to make the average come out negative. A similar consideration 
applies to the enstrophy flux inequality. 

What is remarkable is that in the forced-dissipative case these inequalities can be de- 
rived without any assumptions, except for requiring that the forcing spectrum is confined 
to a finite interval of wavenumbers, which can even be relaxed if necessary. No assump- 
tions on the existence of inertial ranges are necessary, which means that the inequalities 
are also valid in situations where the inertial ranges fail to exist. For the case of decaying 
turbulence, it is necessary to make an assumption concerning the time derivative of the 
energy spectrum, but given that assumption the same inequalities continue to hold. We 
believe that the reason why it is necessary to make an assumption for the decaying case 
is to weed out unusual initial conditions that might temporarily reverse the direction of 
fluxes. In any event, the assumption involved is somewhat weaker than the assumption 
used by Rhines. 

The paper is organized as follows. In section 2 we review the mathematical properties of 
the generalized one layer model. The flux inequalities are proven for the forced-dissipative 
case in section 3. The implications for two-dimensional turbulence are discussed in sec- 
tion 4, and for models of quasi-geostrophic turbulence in section 5. A proof of the flux 
inequalities for the decaying case is given in section 6, and a review of the proof by 
Rhines in section 7. The paper is concluded in section 8. Appendix A reviews the Holder 
inequalities, used in our discussion of the Rhines proof. 



2. Preliminaries 

We shall first present the general case of the one-layer advection-diffusion model which 
encompasses 2D turbulence, CHM turbulence, and SQG turbulence, before considering 
the subcases separately. The governing equation of these systems has the distinguishing 
form of a conservation law for a vorticity-like quantity (: 

^ + J(V, C) = -K-A) p + ^i(-A)-' l K + F, (2.1) 

where ip(x,y,t) is the streamfunction and ( is related to it through a linear operator L 
by C = —Zip. We assume that £ is a diagonal operator in Fourier space whose Fourier 
transform L(k) satisfies L(k) > and L'(k) > 0. The Jacobian term J(tpX) describes 
the advection of ( by ip, and is defined as 

da db db da 
dx dy dx dy ' 

We have written the dissipation of £ in a more general form than normally used. Our 
proof does not depend on the details of the operator T> = v(—/S) p + u\{— A)~ h , only 
that it is a positive operator. F is the forcing function; v is the hyperdiffusion coefficient; 
v\ is the hypodiffusion coefficient. The physical case of molecular diffusion and Ekman 
damping corresponds to p = 1 and h = 0. 

It can be shown that if a and b satisfy a homogeneous (Dirichlet or Neumann) boundary 
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condition, then ((J(a, &))) = 0, where we use the notation ((/)) = JJ (f(x,y))dxdy. It 
follows from the product rule that 

((J(ab, c)» = ((aJ(b, c)}} + ((bJ(a, c)}} = 0, (2.3) 

from which we obtain the identity 

((aJ(b, c)}} = ((bJ(c, a))) = {(cJ(a, b))), (2.4) 

which was also shown previously bv iTran fc Shepherd! l|2004j) . We assume that the op- 
erator L is self-adjoint in the sense that it satisfies ((/(£•<?)}} = ((<?(£/))) for any fields 
f(x,y) and g(x,y). This is true, if we assume that £> is diagonal in Fourier space. 

The conservation law d(/dt + J(-0,£) = conserves the "enstrophy"-like quadratic 
B = (l/2)((£ 2 )} for any arbitrary linear operator L, because 

(0)) = ((CO) = <(-C J{«l>, C)» - «-<M(C, C)» = o. (2.5) 

For self-adjoint operators L, the "energy" -like quadratic A = (1/2)|| — r/>£|| is also con- 
served. To show that, note that 

{(A)) = (i/2)«-^c - C# = (i/2)[(W^,C))) + ((C^-V^C)))] (2.6) 

= (1/2)[((^J(V,0)) + (v^O^C)}}] (2-7) 
= «<W»C)» = («JW>,il>))) =0. (2.8) 

Let ^4(fc) and -B(fc) be the spectral density of A and -B, respectively such that A — 
J +o ° A(k) dk and -B = Jq + °° -B(fc) dfc, and is the isotropic 2D wavenumber. The spectral 
equations are obtained by differentiating A(k) and B(k) with respect to t, and employing 
the Fourier transform of the governing equation (|2.1() : 



dA(k) dn A {k) 

dt + dk 
dB{k) d!L B (k) 
dt + dk 



-D A (k)+F A {k) (2.9) 
-D B (k)+F B (k). (2.10) 



It is understood that ensemble averages have been taken in the above quantities. Here 
ITi(fc) is the spectral density of A transfered from (0, fc) to (fc,+oo) per unit time by 
the nonlinear term in Q2.1|) . D A {k) the dissipation of A, and F A {k) the forcing spectrum 
of A, and likewise for the B equation. The conservation laws imply for the viscous case 
that ILa(0) = linifc^oo 11^ (fc) = and 11^(0) = lim^oo ITs(fc) = 0. For the inviscid case, 
this condition can be violated, in principle, by anomalous dissipation for solutions that 
have singularities. The spectra of A and B are related as B(k) — L(k)A(k), and likewise 
it is easy to show, from the diagonal structure of the £ operator in Fourier space, that 
D B (k) = L(k)D A (k) and F B (k) = L{k)F A (k). C ombining th ese equations with 12.11)1 1 



and H2.9fl we obtain the so-called Leith constraint l)Leithlll96 



ig tri es 
1968): 



-^}-= m ^k-> (2 ' U) 



which shows that if Tl B (k) is strictly constant, then Tl A (k) is also strictly constant and 
vice versa. 
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3. Flux inequalities for the forced-dissipative case 

Assume that the forcing spectrum F A (k) is confined to a narrow interval of wavenum- 
bers [fci,^]. Then, we have 

F A (k) = and F B (fc) = 0,Vfc e (0,ki) U (fc 2 ,+oo), (3.1) 

and we can show, without making any ad hoc assumptions, that under stationarity, the 
fluxes n y i(fc) and H B (k) will satisfy the inequalities 

f L'(q)U A {q) dq < 0, Vfc > k 2 (3.2) 
Jo 

J JT^ u B(q)>0,Vk<k 1 . (3.3) 

The n^fc) inequality is shown as follows: Integrating (|2.9[) and l|2.1L)ll over the (k, +oo) 
interval and employing the stationarity conditions dA(k)/dt — and dB(k)/dt — gives: 

p + OO 

U A (k) = J [Da (q) - F A (q)} dq (3.4) 

r+oc r + co 

n B (/c) = / [D B (q) - F B (q)} dq = / L(q)[D A (q) - F A (q)] dq. (3.5) 
Jk Jk 



Using integration by parts, and the Leith constraint, we have the relation 

/,, dq 



= L(k)U A (k) - f L'(q)U A (q) dq, 
Jo 



(3.7) 



from which we obtain the inequality itself 

/ L'(q)U A (q)dq = L(k)U A (k)-Il B (k) (3.8) 
Jo 

r+co 

= / [L{k) - L(q)][D A (q) - F A (q)] dq (3.9) 
Jk 

< 0, Vfc 6 (fc 2 ,+oo). (3.10) 

Here we use L(q) — L(k) > 0, Vg G (fc, +oo), and D A (q) — F A (q) > which follows from 
> and F A (q) = 0,Vq > fc > fc 2 . 

The counterpart inequality for the flux n^fc) can be derived similarly. We begin by 
integrating l|2.9|l and (|2.1U|) . but this time over the (0, k) interval: 

U A (k) = - I [D A (q) - F A (q)} dq (3.11) 



U B (k) = - / [D B (q) - F B (q)} dq = - L(q)[D A (q) - F A (q)} dq. (3.12) 
Jo Jo 

Similarly, to avoid the singularity at q = 0, we do the integration by parts over the 
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(k, +00) interval: 



n f +00 9nMMl f +oc 1 dn B (g) 

nA(k)= -J k -^r dq ^-J k W)^r dq (3 - 13) 

n * (fc) r*" L 'Mn Biq)dq , (3,4) 



L(k) J k [L(q)}' 
and consequently, we obtain 

+ °° L'(q) L(k)Tl A (k)-n B (k) 

[LW B(q) = W) ( } 

= IJk) I [L(k) ~ l ^ Da ^ - Fa ^ dq ^ 

> 0, Vfc e (0,fci). (3.17) 

Here, the inequality changes direction, because L(k) — L(q) > 0, Vq < k. 
Note that both proofs are based on the inequality 

L(k)IL A (k) - n B (k) < 0, Vfc G (0, fci) U (fc 2 , +00), (3.18) 

which holds both upscale and downscale of the forcing range in t he forced-dissipative 
case d iscussed here. We called this inequality, in a previous paper ijGkioulekas fc Tund 
the "Danilov inequality" because it was communicated to us by Danilov. It is 
worth noti ng; that this in equality is the flux analog of a similar but distinct inequality 
derived hv lFiortofd ()lfl53h in terms of the ener gy spectrum and the enstrophy spectrum. 
Finally, similar flux inequalities were known to lEvinkl l|199fil . Eq. I|3.18|l is a sharper and 
more general variation of these previous results. 

Also note that, for k < k\, since DA(k) > for all fc, it follows immediately from the 
steady state version of l|2.9|l and the assumption l|3.1|l that 



U A (k) = - I D A (q)dq<0,Vke(0,k 1 ). 
Jo 



(3.19) 



In general, one can easily show, for the forced-dissipative case where the forcing spectrum 
obeys (13.1(1 . under statistical equilibrium, that 

n A (k) > and n B (fc) > 0, Vfc G (fc 2 , +00) (3.20) 
n A (fc) < and n B (fc) < 0, Vfc G (0, fci). (3.21) 

It follows that, contrary to some popular misconceptions, both fluxes go downscale on 
the downscale side of injection, and upscale on the upscale side of injection. 



4. Implications for two-dimensional turbulence 

For the case of 2D turbulence, A(k) is the energy spectrum E(k), B(k) is the enstrophy 
spectrum G(fc) and L(k) = fc 2 . The inequality 1(3.2(1 simplifies to: 

fc 

2qU E {q) dq < 0, Vfc £ (fc 2 , +00). (4.1) 

This integral constraint implies that energy fluxes upscale in the net. The constraint 1(4.1(1 
also holds trivially for fc < fci , since 11,4 (fc) < for all fc < fci . For fc > fc 2 , the integration 
range also includes the energy injection interval [fci,fc 2 ] and both the upscale cascade 
range and the downscale cascade range. The inequality 1(4.1(1 implies that the negative 
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flux in the (0, fei) interval is more intense than the positive flux in the (fe, +00) because 
the weighted average of IIe(A;) gives more weight to the large wavenumbers. 
Similarly, l|3.3|) reduces to 



which is a statement that enstrophy fluxes downscale in the net. 

The inequalities (|4.1II and l|4.2[l are the two main results in 2D turbulence we were 
looking for, and they constitute proofs that in forced-dissipative 2D turbulence under 
statistical equilibrium energy predominantly is transferred upscale while enstrophy down- 
scale. 

To understand the implications of these inequalities on two-dimensional turbulence we 
have to distinguish between the following cases and consider them separately: 

(a) No infrared sink of energy, finite box: This is the case considered bv lTran fc Shepherc 

The coefficient of hypoviscosity, which provides the sink at the large scales, is 
zero. i.e. V\ — 0. The only dissipation mechanism is a very small molecular viscosity v, 
with p = 1. Our result of net energy cascade (|4.1|) still holds. However, without a sink of 
energy at large scales, the energy which is fluxed upscale piles up unti l it is dissipated by 
the small viscosity at the forcing scale l|Tran fc Bowmar] l2003. 2004§. No inertial range 
exists where the fluxes of energy and enstrophy are constant. Nevertheless, (|4.1|) implies 
that there is more energy flux dissipated on the upscale side of the forcing range than 
on the downscale side of the forcing range, and likewise (|4.2|l implies that there is more 
enstrophy dissipated on downscale side of the forcing range than on the upscale side of 
the forcing range . 

(b) No infrared sink of energy, infinite box: Same as in case (a) exce pt that the domai n 
is infinite. T his is the classical case of 2D turbulence considered by iKraichnanl l|l967|) , 
iLeithl 1^)68), and iBatcheloil l|l969l) . Although there is no infrared sink of energy, the 
energy cascaded upscale can keep on cascading to ever larger scales. There is no pile up 
of energy, but there is always a spectral region at larger and larger scales where steady 
state cannot be achieved. Let this region be denoted by < k < ko(t). Quasi-steady state 
can be achieved for k > ko. In this latter spectral region, our inequalities l|4.1l) and l|4.2|l 
do hold. Since energy transferred upscale through k is "lost" to the region downscale 
from fc , the infinite domain acts in effect like a perfect infrared sink. Furthermore, in the 
original formulation of the KLB theory, the molecular viscosity coefficient v is taken to 
v — > + , with the result that the energy dissipated at the ultraviolet end of the spectrum 
vanishes in the limit. In this configuration, all injected energy is transferred upscale and 
all injected enstroph y is transferred downscale.Th ese results for the KLB theory have 
been summarized bv lGkioulekas fc Tund l|2005fol(J ). 

(c) Finite infrared and ultraviolet sinks of energy: When there is a finite infrared sink 
of energy upscale of injection and a finite ultraviolet sink of energy downscale of injection, 
there is in general both an upsc ale an d a downscale flux o f energy. This situation has 
been considered in lEvinkl <|l996ft and iGkioulekas fc Tund (|20056lld) . The upscale flux 
should be larger than the downscale flux, according to (|4.ip . It should be noted that, 
because of the inequality l|3.18[l , the contribution of downscale energy flux to the energy 
spectrum in the inertial range on the downscale side of injection is always subleading 
and hidden. This is not tru e in some baroclinic cases of QG turbulence l|Gkioulekasl l2006: 




(4.2) 



lOkiouleka's fc T^OOSrJ). 
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5. Implications for models of QG turbulence 

As derived bv lCharnevI 1119711) . QG turbulence conserves two quantities, total energy, 
which consists of horizontal components of kinetic energy plus available potential energy, 
and potential enstrophy. We now discuss briefly the implications of the flux inequalities 
on one-layer and two-layer simplifications of the quasi-geostrophic turbulence model. 

(a) CUM turbulence: This model is a two-dimensional version of the quasi-geostrophic 
model, and represents physically two-dimensional turbulence on a rotating frame of ref- 
erence. The governing equation is (|2.1|l with L(k) = k 2 + A 2 , where A is the defor- 
mation wavenumber. The total energy E and total potential enstrophy G are given by 
E = (1/2)|||VV>| 2 + A 2 |?A| 2 || and G = (1/2) | CI 2 - The flux inequalities are 



f 2qn E (q) dq < 0, Vfc G (fc 2 , +oo) 
Jo 



(5.1) 



r 2 ^ 2 ^ n G ( g ) dq > o, Vfc e (o, fci), (5.2) 
k (q + * r 

and they still imply that the total energy is mainly transferred upscale whereas the 
potential enstrophy is mainly transfer downscale. 

(b) SQG turbulence: This model can be derived from the quasi-geostrophic model by 
assuming that the potential vorticity is zero over the entire three-dimensional domain. 
Then, it can be shown that the behaviour of the entire system is coupled to its behaviour 
in the boundary condition at the layer z = l|Tung fe Orlandol 12003 b\ . At z = 0, the 
potential temperature O is governed by l|2.1|l with L(k) = fc, where = £. The conserved 
quadratic B represents the total energy E^d of the system at the layer 2 = 0, whereas 
the quadratic A is the total energy E?n inte grated over the whole domain z 6 (0, +oo) 
I Gkioulekasl200f)tlOkioulekas fc Tungl2005flj) . In this system, there is no enstrophy, since 



the potential vorticity has been taken equal to zero, and consequently there is no enstro- 
phy cascade. The flux inequalities are 

/•+00 

/ q- 2 n E . 2D (q) dq > 0, Vfc G (0, fci) (5.3) 
Jk 

[ U E3D (q) dq < 0, Vfc G (fc 2 , +oo), (5.4) 
Jo 

and they imply that downscale from injection the dominant process is a downscale energy 
cascade at the layer z = 0. Upscale from injection the energy spectrum is dominated by 
an inverse energy cascade of the total energy over the entire domain. It should be noted 
that just as in two-dimensional turbulence, a dissipation sink is probably needed both 
upscale and downscale of injection to allow either cascade to form successfully. 

(c) 2-layer model of QG turbulence: This model consists of two symmetrically coupled 
layers of two-di mensional turbulence whe re the deformation wavenumber A is the cou- 
pling constant llSalmonlll97^ Il98fl 11998ft . For the general baroclinic case, specifically 
with Ekman damping only in the l ower layer, Danilqv ' s inequality 113. 1811 does not neces- 
sarily hold for 2-layer models fsee iGkioulekal l|2006|) : iGkioulekas fc Tund l|2005a|) 'l- We 
therefore do not ha ve a conclusive proof for t he case of 2-layer models. However, numer- 
ical results (see e.g. lTung fc Orlandol 1)200301) 1 show that most of the energy will still go 
upscale in this system, although some small fraction goes downscale. In particular, the 
upscale energy cascade in the inertial range upscale of injection is much larger than the 
downscale flux of energy in the inertial range downscale of injection. 
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6. Flux inequalities for the time-dependent case 

We now generalize the proof to time-dependent cases. Since (|3.8|) and (|3.15|) are math- 
ematical identities, they hold whether or not the quantities involved are time-dependent. 

J L'{q)U A (q) dq = L(k)U A (k) - U B (k), Vfc G (k 2 , +00) (6.1) 
L ' {q \n B{ q) = _mU A (k) ; U B (k)^ Vfc e 



[L{q)f a ™ L(k) 
Equations H^.lt)(l and (|3.17|) . however, should be modified to: 

L(fc)n A (fc)-n fl (A) = J + {L(k)-L(q)}[D A (q)-F A (q) + ^^}dq,Vke(0,k 1 ) 

(6.3) 

L(k)n A (k) -U B (k) = -J [L(k) - L(q)][D A (q) - F A (q) + dq, Vfc G (k 2 , +00). 

(6.4) 

Choosing A: to be outside the forcing range [fci , fc 2 ] , and combining the previous four 
equations we obtain: 

J L'(q)U A (q)dq = J + [L(k) - L{q)][D A {q) + dq, Vfc G (fe 2) +oo) (6.5) 

The equations l|6.5|l and (|6.6|) together are a general and remarkable result, because they 
relate the weighted mean of flux of A in (0, fc) to what happens outside this range, and 
the weighted mean of flux of B in (fc, +00) to what happens outside (fc, +00). 

(a) Initial stage: During the initial development, nonlinear interactions transfer energy 
from one wavenumber to another. If the initial condition Ao(k) for A{k) is of compact 
support (which is almost always the case in reality) then we can expect that during the 
initial stages of decay where A is still in the process of spreading there will be a small 
wavenumber e\ > and a large wavenumber e 2 > such that 

Ao(k) = and > 0, Vfc G (0, e x ) U (e 2 , +00). (6.7) 

Combining this condition with l|6.5|l and (|6.6|l . it follows that: 

j L'(q)n A (q) dq < £ [L(k) - L{q)]^^- dq < 0, Vfc G (s 2 , +00) (6.8) 
1 (</) n B (q) > J- [ [L(k) - L(q)}^^ > dq, Vfc G (0, £l ). (6.9) 



[L{q)f L(k)J L v ' WJ dt 

Note that each of the two previous inequalities uses only part of the assumption, i.e 
dA(k) rk 



> 0, Vfc G (e 2 , +00) => f L'(q)n A (q) dq < 0, Vfc G (e 2 , +00) (6.10) 
Jo 

^ > 0, Vfc G (0,e x ) => f °° 7 ^%n B (g) > 0, Vfc G (0, £l ). (6.11) 

Jk i L (q)r 
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Furthermore, for t = 0, the assumption 

A (k) = 0, Vfc e (0, Ex) U (e 2 , +oo), (6.12) 

implies that 

<9/l(fc) 



>0, Vfce (0,ei)U(e 2 ,+oo), (6.13) 

t=o 

from the positivity of A(k). However for t > the latter is an additional hypothesis. 

(b) Intermediate stage: In the intermediate stage, nonlinear spreading and dissipation 
are both active at the small scales. Nonlinear transfer still supplies some A to small and 
large scales by spreading. Therefore 

D A (q) + > 0, Vfc G (0, 6l ) U (e 2 , +oo), (6.14) 

at 



k 



and so from l|6.5|l and l|6.6|l 'l we again obtain 
D A (q) + > 0, Vfc e (e a , +oo) => J L'(q)Il A (q) dq < 0, Vfc e (e a , +oo) (6.15) 

Aifa) + ^ °> Vfc e =^ / + °° [r§p ns(<z) - °' Vfc e (0 ' £i) - (6 - 16) 

(c) Final decaying stage: In the final stages of unforced turbulence, A decays due to 
dissipation. The decay rate of A(k) is the same as the dissipation rate. Therefore, 

D A (q) + = 0, Vfc e (0, ei ) U (e 2 , +oo), (6.17) 

at 

and consequently, 

/ L'(q)U A (q)dq = 0,Vk€(e 2 ,+oo) (6.18) 
Jo 

/•+oo r//^ 1 ) 

/ ^HLu B (q) =0, Vfce (0,ei). (6.19) 

We do not have upscale cascade. During this final stage, nonlinear spreading has already 
occurred, and dissipation of energy dominates. Nevertheless, this still implies that A is 
transferred in the net upscale and B in the net downscale. 

The implication of these results is that net energy flux is directed in the net upscale for 
the time-dependent case of 2D and barotropic QG turbulence in the absence of forcing 
if the initial condition is of compact support and if it is assumed that it subsequently 
spreads into small scales. For SQG turbulence the result is reversed, in the sense that 
energy in the z — layer is transferred downscale in the net. 



7. Remarks on Rhines proof 

Rhines starts with the assumption 

d 

diJ Q 

where K = E\/Eq is the first moment of E(k) and E a is defined as 



(fc - K) 2 E(k) dk > 0, (7.1) 



E a = I k a E(k) dk. (7.2) 
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Here Ij7.1|l is a "postula te that the peak will spread in time" from its current centre of 
mass if lRhinesll|l975^ not necessarily in particular realizations, but in a probabilistic 
sense where an ensemble average over all initial conditions, constrained by the initial 
energy spectrum, has been taken. Rhines then shows that 

which means that the average location of the peak tends to move toward smaller wavenum- 
bers, and conclude s from this that the energy has a tenden cy to be transfered upscale. 

the details of the proof are not given. In lRhinesI (^)79), the following 
more detailed argument is given which is correct for the inviscid case v = and v\ = 0: 
Expanding 

r+ca 

/ (k-K) 2 E(k)dk = E 2 -2KE 1 +K 2 E Q = E 2 -K 2 E a , (7.4) 
Jo 

and solving for K 2 , we obtain 

r+oo 

E K 2 =E 2 - (k- K) 2 E{k) dk. (7.5) 
Jo 

Differentiating with respect to t, and writing E' a — dE a /dt, we have, 

dK 2 d f + °° 

E' K 2 +E a —=E' 2 -±J o (k- K) 2 E{k) dk<E> 2 , (7.6) 

which gives, 

dK 2 E' 2 - E K 2 

~df < -*~E?— (7 - 7) 
If we assume Eq = and E' 2 = 0, which can be deduced from conservation of energy and 
cnstrophy for the case where there are no viscosities, then it follows that 

dK 2 , x 

— < 0. (7.8) 

However, Rhines' argumen t was s upposed to work for the viscous case as well (see page 
405, last equation, of lRhmesI (|l979ln where E' < and E 2 < 0. It appears that the term 
EqK 2 in (|7.6(l was ignored in that derivation. If the term is included, then the right hand 
side of (|7.7|l has two ter ms of opposi te sign, and it is not immediately clear which term 
dominates. Nevertheless. IScottl ()2001^ showed, using the Holder inequality, that the proof 
can still be completed, for the case of Ekman damping and molecular diffusion [h = 
and p = 1). 

As it stands, this proof is interesting, but it cannot be extended to the forced-dissipative 
case because it relies on describing the behavior of time-derivatives of the energy spectrum 
rather than fluxes. Furthermore, it relies on the assumption l|7.1|l . without proof. The 
difference between the assumption (I7.1|l and the assumption used in our proof is that, 
H7.1(l is a global condition stated over the entire range of wavenumbers, whereas the 
assumption needed for our proof in the previous sections is a local condition over the 
intervals (0, £i)U(e2, +oo). We suspect that the need to make some assumption for proofs 
covering the decaying case is unavoidable because it is necessary to weed out unusual 
initial conditions. 

It should be noted that the Rhi nes proof given by Salmon! l|l998|) is different from the 



proof given in the original papers l|Rhineslll975l 1197? ). The difference is that in 1)7.1(1 K, 



which is time dependent, is replaced with a constant wavenumber ki representing the 
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initial positio n of the peak. This modified proof was extended to the general case of a- 
turbulence bv lSmith. Boccaleti. Henning. Marinov. Tarn. Held & Vallia <|2002t) . However, 
we feel that the original assumption j7J is more reasonable, on physical grounds, and 
there is no benefit in modifying 1)7. lfl . 

Furth ermore, it sh ould be stressed that there is an important difference between the 
proof of IScottl l)200l|) and the original Rhines proof. The main difference is that Rhines 
assumes that the unnormalized variance of the energy spectrum is increasing with time 
fEa. H7.lfl ') whereas IScottl 1)200 lj) assumes that the normalized variance o\ is increasing. 
The definition of a\ is 

2 _ J + °°(k-K) 2 E(k)dk 

aE = srmdk • (7 - 9) 

Because the denominator of a 2 E is decreasing with time, it is easy to see that the as- 
sumption da 2 E jdt > is mathematically weaker than the assumption l|7.1[l used in the 
original formulation of the Rhines proof. Consequently, since it is shown to be possible to 
arrive to the same conclusion under a weaker assumption, th e statement proved by IScottl 
l)200l|) is better than the statement claimed by R hines. Thus. l Scottl l|200lj) implicitly also 
rehabilitates the original Rhines proof. However. IScottl l)200l[) did not consider the case 
of hyperdiffusion and hypodiffusion in his paper. 

For the more general case of hyperdiffusion and hypodiffusion, from the conservation 
laws, we find that E' and E' 2 read: 

E' = -2vE 2p - 2v x E- 2h (7.10) 
E' 2 = -2uE 2p+2 - 2z/i£_ 2ft+2 , (7.11) 

and the time-derivative of K 2 is now bound by 

dK 2 < E' 2 - E Q K 2 = 2v{K 2 E 2p - E 2p+2 ) + 2v x {K 2 E_ 2h - E_ 2h+2 ) 

dt Eq Eq 

2v (E\E 2v \ 2v x ( E\E_ 2h \ 

~ Eq E2p+2 J + Eq- {~^f~ E - 2h+2 I (,A " ] 

2 p_ p2 i 



2v E 1 E 2p — E Q E 2p+2 2vi E x E_ 2h - E E_ 2h+2 
E Q El Eq El 



(7.14) 



The first term is again negative because 

ElE 2p+2 — (E E 2p+2 )Eq > (E 2 E 2p )E = E 2p (E E 2 ) > E 2p E 2 , (7.15) 

for all p > 0. Here, we employ the inequality E\ < EqE 2 , and the theorem that the 
function £(k, a) = E K+a /E K is an increasing function with respect to k for a > (see 
appendix which implies that EqE 2p+2 > E 2 E 2p for all real p > 0. For h — 0, the 
second term is negative too. To see this, note that the numerator of that term reads: 

E\Eq - E 2 E 2 = E (E 2 - EqE 2 ) < 0. (7.16) 

However, so far as we know, the sign of the second term is indeterminate when < h < 
1/2 and can be shown to be positive when h > 1/2. To show this, note that: 

ElE- 2 h+2 < E E_ 2 h+iE_ 2 h+2 _ EoE- 2 h + i ^ -E-2/1-E-2/1+1 _ ^ ,y ^ 

E 1 E-2h E\E-ih+2E-2h E\E-2h E-2h + \E-2h 

Here we use Eq/E\ < E-2h+i/E-2h+2, which is valid for h > 1/2 and Eq/E\ < 
E-2h/E-2h+i, which is valid for ft, > 0. It follows that 

E\E-2h - ElE-2h+2 > 0, for h > 1/2. (7.18) 
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Thus, the validity of the Rhines proof continues for the case of hyperdiffusion (p > 1), 
but cannot be extended to the case of hypodiffusion (h > 0). 

It i s interesting to note that when one begins with the weaker hypothesis of IScottl 
it can be shown that the contribution from the hypodiffusion term is always 
positive for h > 0. In the argument above we ca nnot show this unless h > 1/2. To 
see this, we retrace the argument of IScottl l|2f)f)ll) for the case of hyperdiffusion and 
hypodiffusion: 



dK 2 _ d (E 2 \ d(7% _ E' 2 E Q -E 2 E' da% 



dt dt \E„J dt E$ dt 



(7.19) 



"7v5-(-E'2p+2-E ; o — E 2 E 2p ) — — j(E- 2 h+ 2 E — E 2 E- 2 h) -p. (7.20) 

E E dt 



The first term is negative because E 2p+2 E > E 2 E 2p , for all p > 0. As IScot tl l)200l|) 
noted, the second term vanishes for h = 0, however it is positive for h > 0. Thus, for 
h > 0, the sign of dK 2 j dt remains indeterminate. 



8. Concluding remarks 

We have shown two inequalities (|3.2|) and l|3.3fl , which for the case of two-dimensional 
turbulence imply that the weighted-average of the energy flux is negative and the weighted- 
average of the enstrophy flux is positive. This implies that the energy tends to go upscale 
in the net and the enstrophy tends to go downscale in the net. For the forced-dissipative 
case, the inequalities can be derived without any ad hoc assumptions. For the decay- 
ing case, a sufficient condition for the energy inequality is to assume that there exists 
a very large wavenumber k such that over the interval (fc, +00) the energy spectrum 
is increasing or constant. Likewise, for the enstrophy inequality it is sufficient that we 
assume that there exists a very small wavenumber k such that over the interval (0, k) 
the energy spectrum is also increasing or constant. From a physical point of view, these 
assumptions are slightly more plausible than the assumption l|7.1|l made by Rhines in 
his proof. It should be noted that unlike previous proofs in both the forced-dissipative 
and the decaying case, the inequalities have the same mathematical form. Our argument 
then is a unified proof that covers all cases, and specialized results can be deduced from 
our inequalities for special cases. We have also briefly discussed the implications of our 
results for one-layer and two-layer models of quasi-geostrophic turbulence. 

Note that none of the results obtained in this paper forbids energy from being trans- 
ferred downscale even when it is shown that the net flux should be directed upscale; they 
merely say that in those cases the energy going upscale in the upscale range should be 
larger than that going downs cale in the downscale range. In fact, for the case of finite 
domains with finite viscosity. iGkioulekas fc Tu ng (2005k. c) showed that the downscale 
flux of energy on the s hort-wave side of injectio n must be nonzero. Even in the case 
of 2-layer model where iTung fc Orlando! (2003(3) found in their numerical experiment 
that the downscale energy flux over the mesoscales contributes visibly to the observed 
energy spectrum, it is still true that there is a larger inverse energy cascade from the 
synoptic to the planetary scales. The exception is the case of Surface QG turbulence, 
where most of the energy at the z = layer goes downscale, as shown here. We sus- 
pect that this may be due to the collapse of temperature gradients on solid surfaces (a 
model of frontogenesis), and differs from the turbulence in th e free atmosphere. In the 
free troposphere, there i s stro ng observational evidence (e.g. iBoer fc Sheoherdl l|l983f) 
and lStraus fc Ditlevsenl l(l999h ) that energy flux is negative (upscale) from synoptic to 



i 
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lanetary scales, and the positive (downscale) flux over t he mesoscales ( Cho fc Lindborel 



200ll ICho. Newell. Anderson. Barrick fc ThornhiJl2003t) is small by comparison 
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Appendix A. Holder inequalities 

Let f(x) and g(x) be two functions defined over a domain x £ A, such that 

f(x) > and g(x) > 0, Vx £ A, (A 1) 

and let a, b be real numbers such that (1/a) + (1/6) = 1. Then the Holder inequalities, 
in the integral form, read 

f(x)g(x)dx< ( [ [f(x)] a dx] ' ( [ [g(x)} b dx] ' . (A 2) 



A \JA 



For the case a = b = 2 and A = (0, +00) with f(x) = k a ^E(k) and g(x) = k - s /E(k), 
we have 

r+00 

E a+f3 = / k a+ ^E(k)dk (A3) 
Jo 

<lj (k a y/Ejk)) 2 dkj (J {yJkPE(k)) 2 dk) = ^E 2a E 20 . (A 4) 



For the cases (a, 0) = (0, 1) and (a, 0) = (0, 2) we obtain E\ < E E 2 and E\ < E E 4 by 
raising squares, noting that all the quantities involved are positive. For the case a 1— ► k 
and f3 1 — > k -\- loc we get the inequality E 2 +a < E K E K+2a which can be rewritten as 

< E * +2a . (as) 

E K E K + a 

This inequality appears to indicate that the function £(k, a) = E K+a /E K is an increasing 
function with respect to K for a > and decreasing for a < 0. To prove this, we first 
note that from Ea. (|A 5|) we have, for any integer n > 

TP n TP 

£{K + a/n,a) = — = |h (A 6) 



> 



' = __ = £(«,«). (A 7) 



It follows that since £(k, a) is a differentiable function with respect to k, that 9£(k, a) /9k > 
for a > and <9£(k, a) /Ok < for a < 0, and thus £(k, a) is an increasing function 
with respect to K for a > and a decreasing function for a < 0. All the inequalities 
needed for our discussion of Rhincs proof can be deduced from this result. 
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